Computing Similarity between Cultural Heritage Items using Multimodal Features
نویسندگان
چکیده
In recent years a vast amount of Cultural Heritage (CH) artefacts have been digitised and made available on-line. For example, the Louvre and the British Museum provide information about exhibits on their web pages1. In addition, information is also available via sites that aggregate CH information from multiple resources. A typical example is Europeana2, a web-portal to collections from several European institutions that provides access to over 20 million items including paintings, films, books, archives and museum exhibits. However, online information about CH artefacts is often unstructured and varies by collection. This makes it difficult to identify information of interest in sites that aggregate information from multiple sources, such as Europeana, or to compare information across multiple collections (such as the Louvre and British Museum). These problems form a significant barrier to accessing the information available in these online collections. A first step towards improving access would be to identify similar items in collections. This could assist with several applications that are of interest to those working in CH including recommendation of interesting items (Wang et al., 2008), generation of virtual tours (Wang et al., 2009), visualisation of collections (Hornbaek and Hertzum, 2011) and exploratory search (Amin et al., 2008). Information in digital CH collections often includes multiple types of media such as text, images and audio. It seems likely that information from all of these types would help humans to identify similar items and that it could help to identify them automatically. In this work we make use of information from two media (text and images) to compute the similarity between items in a large collection of CH items. A data set was created by selecting 295 pairs of items from Europeana. Each item corresponds to a metadata record consisting of textual information together with a URI and a link to its thumbnail. Figure 1 shows an item taken from the Europeana website. These are extracted and concatenated to form the textual information associated with each item. In addition, the accompanying thumbnail image (or “preview”) was also extracted to be used as the visual information. The size of these images varies from 7,000 to 10,000 pixels. A range of similarity measures for text and images are compared and combined. Text similarity is measured by applying a range of corpus-based methods, i.e. tf.idf and LDA (Blei et al., 2003). Image similarity is computed by making use of colour histograms and the Haar wavelet decomposition (Beylkin et al., 1991) to create signatures of images that contain colour and basic shape information. Evaluation is carried out using a set of items from Europeana with similarity judgements that were obtained in a crowdsourcing experiment. We find that combining information from both media produces better results than when either is used alone. http://www.louvre.fr/, http://www.britishmuseum.org/ http://www.europeana.eu
منابع مشابه
Pattern-Based Retrieval of Cultural Heritage Images
Conventional Content-Based Image Retrieval (CBIR) systems make use of similarity measures estimated directly from low-level image features, involving multidimensional and exhaustive, nearest neighbor searching. In this paper we present an image retrieval methodology suited for efficient search in cultural heritage images that utilizes similarity measures defined over higher-level patterns assoc...
متن کاملEstimating the Relevance of Search Results in the Culture-Web: A Study of Semantic Distance Measures
More and more cultural heritage institutions publish their collections, vocabularies and metadata on the Web. The resulting Web of linked cultural data opens up exciting new possibilities for exploring these cultural heritage collections. We report on ongoing work in which we investigate the estimation of relevance in this Web of Culture. We study existing measures of semantic distance and how ...
متن کاملA Dataset for Multimodal Question Answering in the Cultural Heritage Domain
Multimodal question answering in the cultural heritage domain allows visitors to museums, landmarks or other sites to ask questions in a more natural way. This in turn provides better user experiences. In this paper, we propose the construction of a golden standard dataset dedicated to aiding research into multimodal question answering in the cultural heritage domain. The dataset, soon to be re...
متن کاملBUT-TYPED: Using domain knowledge for computing typed similarity
This paper deals with knowledge-based text processing which aims at an intuitive notion of textual similarity. Entities and relations relevant for a particular domain are identified and disambiguated by means of semi-supervised machine learning techniques and resulting annotations are applied for computing typedsimilarity of individual texts. The work described in this paper particularly shows ...
متن کاملGrieser, Karl, Timothy Baldwin, Fabian Bohnert and Liz Sonenberg (2011) Using Ontological and Document Similarity to Estimate Museum Exhibit Relatedness, ACM Journal of Computing and Cultural Heritage 3(3), pp. 1-20
Exhibits within Cultural Heritage collections such as museums and art galleries are arranged by experts with intimate knowledge of the domain, but there may exist connections between individual exhibits that are not evident in this representation. For example, the visitors to such a space may have their own opinions on how exhibits relate to one another. In this paper, we explore the possibilit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012